Generate n-Grams (Terms) (Text Processing)
Synopsis
Creates term n-Grams of tokens in a document.Description
This operator creates term n-Grams of tokens in a document. A term n-Gram is defined as a series of consecutive tokens of length n. The term n-Grams generated by this operator consist of all series of consecutive tokens of length n.
Input
- document
The document port.
Output
- document
The document port.
Parameters
- max_lengthThe maximal length of the n-Grams. Range: